Symposium on Theoretical Aspects of Computer Science 2010 (Nancy, France), pp. 191-202 
www.stacs-conf.org 



ROBUST FAULT TOLERANT UNCAPACITATED FACILITY LOCATION 



Abstract. In the uncapacitated facility location problem, given a graph, a set of demands 
and opening costs, it is required to find a set of facilities R, so as to minimize the sum 
of the cost of opening the facilities in R and the cost of assigning all node demands to 
open facilities. This paper concerns the robust fault-tolerant version of the uncapacitated 
facility location problem (RFTFL). In this problem, one or more facilities might fail, and 
each demand should be supplied by the closest open facility that did not fail. It is required 
to find a set of facilities R, so as to minimize the sum of the cost of opening the facilities 
in R and the cost of assigning all node demands to open facilities that did not fail, after 
the failure of up to a facilities. We present a polynomial time algorithm that yields a 6.5- 
approximation for this problem with at most one failure and a 1.5 + 7.5a-approximation 
for the problem with at most a > 1 failures. We also show that the RFTFL problem is 
NP-hard even on trees, and even in the case of a single failure. 



Introduction 

The robust fault-tolerant facility location problem 

For a given optimization problem, the robust fault-tolerant version of the problem calls 
for finding a solution that is still valid even when some components of the system fail. 
We consider the robust fault-tolerant version of the uncapacitated facility location (UFL) 
problem. In this problem, given a graph 67, a demand u{v ) for every node v and a cost f(v) 
for opening a facility at v, it is required to find a set of facilities R, so as to minimize the 
sum of the costs of opening the facilities in R and of shipping the demands of each node 
from the nearest open facility (at a cost proportional to the distance). In the robust fault- 
tolerant version of this problem (RFTFL), one or more facilities might fail. Subsequently, 
each demand should be supplied by the closest open facility that did not fail. It is required 
to select a set of facilities R, so as to minimize the sum of the costs of opening the facilities 
in R and the costs of assigning all node demands to open facilities that did not fail, after 
the failure of up to a facilities. We present a polynomial time algorithm that yields a 6.5- 
approximation for this problem with at most one failure and a 1.5 + 7.5a-approximation 
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for the problem with at most a > 1 failures. We also show that the RFTFL problem is 
NP-hard even on trees, and even in the case of a single failure. 

Related Work 

Many papers deal with approximating the UFL problem, cf. [3j 01 O [12j [13]. The 
best approximation ratio known for this problem is 3/2, shown by Byrka in [2]. 

A fault-tolerant version of the facility location problem was first introduced by Jain 
and Vazirani [10J, who gave it an approximation algorithm with ratio dependent on the 
problem parameters. The approximation ratio was later improved by Guha et al. to 2.41 
[8] and then by Swamy and Shmoys to 2.076 [H]. However, the variant of the problem 
studied in these papers is different from the one studied here. In that version, every node j 
is assigned in advance to a number of open facilities, and pays in advance for all of them. 
More explicitly, every node j is assigned to rj open facilities, and its shipping cost is some 
weighted linear combination of the costs of shipping its demand from all the facilities to 
which it is assigned. It is required to find a set of facilities R that minimizes the sum of the 
costs of opening the facilities in R and the sum of costs of shipping the demand of each node 
j from its rj facilities in R. This approach is used to capture the expected cost of supplying 
all clients demand when some of the facilities fail. In contrast, in our definition a node j 
does not pay in advance for shipping its demand from a number of open facilities. Rather, 
it pays only for the cost of shipping its demand from the surviving facility that actually 
supplied its demand. Hence our definition for the fault-tolerant facility location problem 
requires searching for a set of facilities R that minimizes the sum of the costs of opening 
the facilities in R and the costs of assigning the demands of each node to one open facility 
that did not fail, for any failure of up to a facilities. Our approach is used to capture the 
worst case cost of supplying all clients demand when some of the facilities fail. We argue 
that our definition may be more natural in some cases, where after the failure of some 
facilities, each demand should still be supplied by a single supplier, preferably the closest 
surviving open facility, and each client should pay only for the cost of shipping its demand 
from that surviving facility, and not for all the other (possibly failed) facilities to which it 
was assigned originally. On the technical level, the approach taken in [HJ \TU[ [T4"] is based on 
applying randomized rounding techniques and primal-dual methods to the corresponding 
integer linear program. This approach does not readily apply to our version of the problem, 
and we use a direct combinatorial algorithmic approach instead. 

Two other closely related types of problems are the 2-stage stochastic and robust opti- 
mization problems (cf. 0(6]). Both of these models involve two decision stages. In the first 
stage, some facilities may be purchased. This stage is followed by some scenario depending 
on the specifics of the problem at hand (in a facility location problem for example, the 
scenario may specify the clients and their corresponding demands). Subsequently, a second 
stage is entered, in which it is allowed to purchase additional facilities (whose cost might 
be much higher than in the first stage). In stochastic optimization there is a distribution 
over all possible scenarios and the goal is to minimize the expected total cost. In robust 
optimization the goal is to minimize the cost of the first stage plus the cost of the worst case 
scenario in the second stage. In contrast with these two models, in our variant the facilities 
must be selected and opened in advance, and these advance decisions must be adequate 
under all possible future scenarios. 
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Billionnet and Costa PQ showed a polynomial time algorithm for solving the ordinary 
(non-fault-tolerant) UFL problem on trees. In contrast, we show that the fault-tolerant 
variant RFTFL is NP-hard on trees. 

1. Preliminaries 

Let us start with common notation to be used later on. Consider an optimization 
problem II over a universe V, which given an instance /, requires finding a solution consisting 
of a set of elements R C V. Denote by Cn{L, R) the cost of the solution R on the instance 
/ of II. Let Rji(I) denote the optimal solution to the problem IT on instance I, and let 
C^(J) = Cn(I,Rn(I)) be the cost of the optimal solution. We denote our algorithm for 
each problem II studied later by An(L). The solution returned by the algorithm is referred 
to as Ru 9 (I) and its cost is C^ 9 (I) = C n (I , R^ 9 '(/)) . 

Let us now define the uncapacitated facility location (UFL ) problem. Let / = {G, I, f,uj) 
be an instance of the problem, where G = (V,E) is a graph with vertex set V = {1, ...,re} 
and edge set E. Each node v G V hosts a client in need of service, and may host a facility, 
providing service to clients in nearby nodes. Each edge e G E has a positive length /(e). 
The distance d(u, v) between two points u and v on G is defined to be the length of the 
shortest path between them, where the length of a path is the sum of the lengths of its 
edges. For each node v, let f(v) denote the opening cost associated with placing a facility 
at v, and let u>(v) denote the demand of the node v. The shipping cost of assigning the 
demand u{u) of a client u to an open facility v is the product SC UjV = u(u)d(u,v). The 
shipping cost SC U) r from a set of open facilities R to a node u is the minimum cost of 
assigning u to a server in R, namely, SC Uj r = imn{SC UjV | v G R}. Defining the distance 
d(v, R) between a set of points R and a point v on G to be the minimum distance between 
v and any node in R, i.e., d(v,R) = min re # d(v , r), we also have SC v .r = uj(v)d(v, R). 

It is required to find a subset R C V that minimizes the sum of costs of opening the 
facilities in R and the shipping costs from R to all other nodes. This problem can be 
formulated as searching for a subset R C {1, that minimizes the cost function 

C UFL (I,R) = C facil (I,R) + C ship (I,R), (1.1) 

where 

n n 

C f acii(I,R) = J2f(r) and C ship (I,R) = J2 SC ^R = E ^ ' d ^ R ^ 

rG-R u=l u=l 

Given a set R of open facilities and a facility r € R, let <p(I, r, R) denote the set of clients 
that are served by r under R, i.e., <p(I,r,R) = {u \ d(v,r) < d(v,r') for every r' G R}, or 
in other words, the nodes u that satisfy d(u, R) = d(u, r), where ties are broken arbitrarily, 
i.e., if there is more than one open facility r such that d(u,R) = d(u,r), then just choose 
one open facility r that satisfies d(u, R) = d(u, r) and add u to tp(I, r, R). (When the set R 
is clear from the context we omit it and write simply <p(I, r), or even (p(r) when the instance 
/ is clear as well.) 

The robust fault-tolerant facility location (RFTFL) problem is defined as follows. Each 
client is supplied by the nearest open facility, and in case this facility fails - it is supplied 
by the next nearest open facility. We would like to find a solution that is tolerant against a 
failure of one node. This problem can be formulated as searching for a subset R C {1, n} 
that minimizes the cost function 
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Crftfl(I, R) = Cf aci i(I, R) + C ship (I, R) + C backup (I, R), (1.2) 
where Cf ac u(I, R) and C s hi p (I, R) are denned as above and 



Cbacku P (I, R) = max < 

reR 



«,{v)-(d(v,R\{r})-d{v,r))\ . (1.3) 



Note that 



C RFTFL (I,R) = C facU (I,R)+m^{C ship (I,R\{r})} 

r€R 



C fac u(I, R) + max <j SC VyR \ {r} 



,v=l 



= C faca (I,R) + ^^u(v)-d(v,R\{r})j . (1.4) 

Again, when the instance / is clear from the context we omit it and write simply Crftfl(R), 
Cfacii(R), C s hip(R), Cb ac k up (R), etc. 

We also consider the robust a- fault-tolerant facility location (ot-RFTFL) problem, for 
integer a > 1, where the solution should be resilient against a failure of up to a nodes. We 
define the a_RFTFL as follows. Each client is supplied by the nearest open facility which 
did not fail. We are looking for a subset R C {1, ...,n} that minimizes the cost function 

C a _ R FTFL(I,R) = C facil (I,R)+ max \ f2u(v)-d(v,R\R')\ . (1.5) 



2. A constant approximation algorithm for RFTFL 

2.1. The concentrated backup problem and its approximation 

Towards developing a constant ratio approximation algorithm for RFTFL, we first 
consider a different problem, named concentrated backup (conc-bu), defined as follows. An 
instance of the problem consists of a pair (I, R\) where / = (G, l,f,w) is defined as before 
and i?i = {r\, r^} is a set of nodes. In this version, the nodes of R\ act as both clients 
and servers (with open facilities), and all other nodes v £ R\ have zero demands. Informally, 
it is assumed that we have already paid for opening the facilities in Ri, and each r € R\ 
serves itself, at zero shipping cost. The problem requires to assign each client r G R\ to a 
backup server ti/r, which may be either some server in R\ or a new node from V\i?i. For 
a set of nodes R2, define the backup cost 

C bu (I,R 1 ,R 2 ) = max{SC r i?lUR2 \{ r i} = max {u(r)d(r, Ri Ui? 2 \M)}- 

reRi rGRi 

We are looking for a set R2 minimizing 

C C oncJbu(I, Rl, R2) = Cfacil{R2) + Cfeu (Rl , R2) ■ (2-1) 

We denote this minimum cost by C* onc bu (I-, R\)- We show a 2-approximation algorithm for 
the concentrated backup problem. 
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The problems studied in this section and in section 13,11 are closely related to those 
considered in and to solve them we use methods similar to the ones presented in 

Let us consider a simpler variant of the backup problem, named the bounded backup (bb) 
problem, which is defined on (I,Ri,M) and requires looking for a solution R2 minimizing 

Cbb(I, Ri, M, R 2 ) = Cf aci i{R2) 

subject to the constraint Cb u (Ri, R2) < M, for integer M. We now present a relaxation 
algorithm that finds a set R2 satisfying Cf ac u(R2) < C bb (R±, M) but obeying only the 
relaxed constraint Cb u (Ri, R2) < 2M instead Cb u (Ri, R2) < M. 



Algorithm A bb (I, R x , M) 

(1) < 9 <- 

(2) For i = 1 to k do: 

• Si <(— {v I uj{ri)d{v,r,i) < 2M}\{rj} /* "relaxed" backup servers for 

7 

• If Si n (Ri U Rfo 9 ) = then add to R^ 9 the node v in Si with the 
minimum facility cost f(v). 

(3) Return R b 9 . 



Let us now prove the properties of algorithm Abb- For every € Ri let the set of 
feasible backup servers be Tj = {v \ uj(ri)d(v,ri) < M}\{rj}. Let the set of relaxed backup 
servers selected by the algorithm (namely, the final set R bb 9 it returns) be R?l 9 (Ri,M) = 

{q^ 9 , q a J 9 }- Let £j be the phase in which the algorithm adds the new facility q^ 9 to R bb 9 , 
for 1 < j < J. 

Lemma 2.1. T u n T t . = for 1 < i, j < J. 

Proof: Assume otherwise, and let v £ flTf for some 1 < i,j < J, i 7^ j. Assume 
without loss of generality that w(r^) < w(r£.). Since uj(r£.)d(v,re ) < M, necessarily 
Lo(r£.)d(v , r# ) < M as well, and by the definition of T^., also u}(r£.)d(v,r£.) < M, hence 

w(r^)d(r4,r^.) < ui(r ei )(d(v, r^) + d(v, r t] )) < 2M, 

implying that 77. € <5^ Pi R%, so the algorithm should not have opened a new facility in 
phase £i, contradiction. | 

Lemma 2.2. Cf aci i (R^ 9 (R\ , M) ) < C* b (R h M). 

Proof: Notice that there must be at least one node from every Tj in the optimal solution 
RtJRi, M). By Lemma l2.1l the sets T^ 1 , ...,T£j are disjoint, so there are at least J distinct 
nodes q* € Rl b (R\, M), one from each T^., for 1 < j < J. In each phase i, the algorithm 

selects the cheapest node in Si D Tj. Therefore, f{q alg ) < f(qf) for every 1 < j < J. Hence 
C faca {Rt 9 {Ri,M)) = t f{qf 9 ) < t f(q*) < C; b (Rt,M). | 

Lemma 2.3. C&„(i?i, Rf b {Ri, M)) < 2M. 
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Proof: For each server r j in R\ , the algorithm ensures that there is at least one open facility 
from the set S h so u(n)d(ri, Ri U R bb 9 (Ri,M) \ {n}) < 2M. | 

Now we present an approximation algorithm A conc _b u for the concentrated backup prob- 
lem using the relaxation algorithm Abb for the bounded backup problem. First note that 
there can be at most nk possible values for the shipping costs SC U>V = ui(u)d(u,v). 

Algorithm A conc _ bu (I, Rx) 

(1) For every M € {SC U)V \ u,v € V} do: 

. let Rf b 9 (R\, M) '<- Abb(I, Ri, M). 

(2) Return the set R bb {Ri,M) with the minimum cost 

CconcJbu (Rl j Rbb ' ^0 ) ' 



Lemma 2.4. C^ cJm (^i) < 20^(1, i^). 
Proof: Recall that, letting = R* cmc _ bu {Ri) , 

C*onc-bu(l \ Rl) = C conc _b u (I, R\, R%) = Cf ac u(R2) + Cbu{I, Rl, R^)- 

Let it £ be the node that attains the maximum shipping cost SC U)RiU r 2 \i u \, i.e., satisfies 
uj(u)d(u,Ri U R%\{u}) = Cb„(/,i?i,i?2), and let u G i?i U R^Mu} be its backup, i.e., the 
closest node to u. Then C* concJm (I, R{) = C conc . 6n (I, Ri,R* 2 ) = C facU (R* 2 )+SC u v . Since the 
algorithm examines all possible values of M, it tests also Mq = SC UtV . For this value, the 
returned set R^ 9 {R\,Mq) has opening cost at most C bb (R\, Mq) = Cf ac u(R2) and backup 
cost at most 

C bu {I,Ri,Rtb{Ri,M )) < 2M by Lemmas O and E3 S ince the algorithm takes the 
minimum cost C conc j m {Ri,R b d9 {Ri,M)) over all possible values of M, the resulting cost 

satisfies Cf 9 ncJm {I,Ri) < C f aci i{R* 2 ) + 2SC u , v < 2C* oncJbu (I, Ri), namely, an approximation 
ratio of 2. | 



2.2. 6.5-approximation algorithm for RFTFL 

We now present a polynomial time algorithm Arftfl that yields 6.5-approximation 
for the robust fault-tolerant uncapacitated facility location problem RFTFL. Consider an 
instance I = (G, I, f,oj) of the problem. The algorithm consists of three stages. 
Stage 1: Apply the 1.5-approximation algorithm of [2j to the original UFL problem in 
order to find an initial subset R\ of servers. Notice that the cost of this solution satisfies 

Cufl(Ri) < l.SC% FL < 1.5C* RFTFL . (2.2) 

Each node is now assigned to a server in R\. Next, we need to assign to each node a backup 
server which will serve it in case its original server fails. 

Stage 2: Transform the given instance I = (V,l,uj,f) of the problem into an instance 
I' = (V, 1,uj', f) as follows. First, change the facility cost / by setting f'(r) = for r € R\. 
Next, for each server r G R%, relocate all the demands of the nodes that are served by r, 
and place them at the server r itself, that is, set 

oj'ir) = < ve<p(l,r,Ri) (2.3) 
{ 0, for r £ Ri. 
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Stage 3: Invoke the 2-approximation algorithm A conc j, u for the concentrated backup prob- 
lem on the new instance I' and the set i?i . The approximation algorithm returns a new set 
i?2- We then return the set R± U R2 as the final set of open facilities. 

Lemma 2.5. For every instance I and set R\ C V, C* onc _ bu {I' , R\) < C* RFTFL {I) + 
Cufl(I, Ri)- 

Proof: Consider some vertex r E R\ and let ip(I,r,Ri) = {v\,...,v r k } be the nodes it 
serves. Consider the optimal solution Rrftfl(I) t° the RFTFL problem. Let d\ be the 
distance from r to v\ for 1 < i < k r , and also let x\ be the distance from v\ to its optimal 
backup server, which is also its distance to R* = R\ U R RFTFL (I)\{r}, i.e., x\ = d(v\, R*). 
By the triangle inequality, d(r, R*) < d\ + x\, for every 1 < i < k r , so 

u\r)-d{r,R*) = ^^n-^^^E^nw+^n 

1=1 1=1 



= E^n^r^i) + E^ r K 
1=1 1=1 

n n 



v=l v=l 



Therefore, 



CUI',Ri,Rrftfl(I)) = max{Lj'(r)-d(r,R* r )} 

reRi 

< CshiJl.Ri} + max { • d(v,R*) }. 

reRl \ ti J 
Using (ll.4p and (12. ID we now bound the cost of the optimal solution for problem concjbu 
by 

CconcJbu (J' < C conc _b„ (/' , Ri , R*RFTFL {I) ) 

= Cf aci i(l', Rrftfl^ 1 )) + Cbu(I', Ri 1 Rrftfl i-0) 

< C fac a(l',R RFTFL (I))+max\S2u(v)d(v,R;)\ + C ship (I,Ri) 

reRl [tl J 

< C* RFTFL {I) + C ship (I, Ri) < C RFTFL (I,Ri) + Cufl(I,Ri)- I 

Lemma 2.6. For every instance I and sets R\,R2 C V, 
Crftfl(I, Ri U R2) < Cufl{I,R\) + C conc _b u (I' , R\, R2)- 

Proof: The cost of opening the facilities in Ri DR2 is clearly at most the cost of opening the 
facilities in R\ plus the cost of opening the facilities in R2. For every facility r E R\ U R2, 
in order to bound C s hi p (I, R\ U R2 \ {r}), note that one can first move each client v to its 
closest open facility in Ri, and then move all the clients assigned to r (if r E to the 
backup facility of r in R2. The inequality follows. More formally we have the following. 
Recall that by ([Oil . 

CrftflV, Ri U R2) = C facil (I, Rx U R 2 ) + max { C ship (I, i?i U R 2 \ {r})} . 

rgi(iUi?2 
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Consider first the case that max {C s hi v (I, Ri U R 2 \ W})} is attained for some r' £ R 2 . 

reRiUR 2 F 

In this case, we get by that 

C RFTFL (I,R 1 UR 2 ) = C facil (I,R 1 UR 2 ) + C ship (I,R 1 UR 2 \{r'}) 

< Cfacil(I, Rl U R2) + C s hip(I, Rl) 

= Cufl{I, Rl) + Cf aci i(I, R 2 ) < CuFl(I, Rl) + Ccancjyu^ , R%, R 2 ) 

So now assume that max {C s hi p (I, R\U R 2 \ {r})} is attained for some r' & R±. There- 

re-RiUJ?2 

fore, 

C RFT fl(I,RiUR 2 ) = C facil (I,RiUR 2 ) + C ship (I,RiUR 2 \{r'}) 

n 

= Cf ac u(I, R\) + Cfacil(I, R 2 ) + ^2 SC v> R 1 \jR 2 

v=l 

+ Yl u J (v)-(d(v,R 1 UR 2 \{r'})-d(v,r')) 
veip(I,r',RiUR2) 

< CuFL(I,Rl) + C fac u(I,R 2 ) 



+ max < 

reRi 



k veip(I,r,Ri) 



= C UFL (I,R 1 ) + C facil (I,R 2 )+m a x{w'(r) ■ (d(r, R x U R 2 \{r}))} 

= Cjjfl(I,Ri) + C conc j, u (I ,Ri,R 2 ). I 

Lemma 2.7. Algorithm Arftfl yields a 6. 5- approximation for the RFTFL problem. 

Proof: Consider the set of opened facilities R\\JR 2 . By Lemma l2.44 R 2 is a 2-approximation 
of the concentrated backup problem on the instance I', so 

C C oncJbu(l' , Rl, R2) < 2C* onc _ bu (/', Rl). 

By Lemma ESI C* concJm {I' , R x ) < C RFTFL (I) + C UFL (I, Rt), hence 

Cconc_bu{I ,Rl,Rl) < ^C* RFTFL (I) + 2CuFL(I,Rl)- 

Using Lemma |2. 61 we get 

Crftfl(I,RiUR2) < 3C UFL (I,R 1 ) + 2C RFTFL (I), 
and by <^M, C RFTFL (I , Ri U R 2 ) < 6.5C* RFTFL (I). | 

3. An approximation algorithm for a_RFTFL 

3.1. The concentrated a_backup problem 

As in the case of a single failure, we first consider a different problem, named concen- 
trated a-backup (conc-ajbu), defined as follows. An instance of the problem consists of a 
pair {I, Ri) where / = (G, /,/, uj) is defined as before and Ri is a set of nodes. The nodes 
of Ri act as both clients and servers (with open facilities), and all other nodes v ^ Ri have 
zero demands. We are looking for a set R 2 minimizing 

(I,Ri,R 2 ) — Cf aci i(R 2 ) + C a _b u (I, Ri, R 2 ), (3.1) 
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where C a _b u is the maximum a -backup cost for a set of nodes R2, defined as 

C a _bu(I, Ri,Ba) = max \ ^ w ( r ) " d ( r > R i u R *\ F ) \ • 

We will shortly present a 3a-approximation algorithm for the concentrated a-backup 
problem. 

Towards this, let us first consider a simpler variant of the backup problem, named the 
a-bounded backup (aJob) problem, which is defined on (I,Ri,M) and requires looking for 
a solution R2 minimizing 

C a _bb{Ri,M,R2) = Cf ac u{R.2) 
subject to the constraint Cn g ht_ a _bu(Ri, R2) < M for some integer M, where 
QightjuJruiRi, R2) = max {u(r)d(r,RiU R 2 \F)}. 

r£Ri,\F\<a 

We now present a relaxation algorithm that finds a set R2 satisfying Cf ac u{R2) < 
C*_ bb (R\, M) but allowing the relaxed constraint Cn g ht.ajm(Ri, R2) < 3M instead of 
Clight ja jm(Ri, R2) < 



Algorithm A a _ bb (I, Ri,M) 

(1) <- 

(2) Let ri, rfc be the servers in i?i sorted by nonincreasing order of demands. 

(3) Z ^— /* The set of servers rj where the algorithm opens facilities in phase 

i*/ 

(4) For i = 1 to A; do: 

(5) • Si <- {u I w(ri)d(«, r 4 ) < 2M}\{n}. 

• r< <- I u}(ri)d(v,n) < M}\{r t } 

• If Si n Z = then: 

- Add to the a - |T< n U <_ 9 J| nodes in U R%) 

with the lowest facility costs. 
-Z^ZU{ ri } 

(6) Return ift. 



Let us now prove the properties of Alg. A a _bb- Let {lj | 1 < j < J} be the phases in 
which the algorithm adds new facilities to R^ 9 bb - By a proof similar to that of Lemma 12.11 
we have the following. 

Lemma 3.1. T l% nT £j = for 1 < j < i < J. 
Lemma 3.2. C facU {Ri%{Ri,M)) < C* a _ hb (R x , M). 

Proof: There must be at least a nodes in every T £ . in the optimal solution R* a bb (R\,M). 
By Lemma 13. II the sets T £j for 1 < j < J are disjoint, so the only nodes that the algorithm 

adds to R^ 9 bb from the set T £ . are added at phase ij. The algorithm selects the cheapest 

nodes in T £ . in order to complete to a nodes. Therefore, C f ac n{R a ^ 9 bb {Ri,M) n T £ .) < 
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Cfacu{R* a _ bb {Ri,M) n T t .) for every 1 < j < J. Hence 

J J 

c facU (R a X{R^ M )) = E^(<;(^i-M)n^.) < Y J Cfacii(R* a _ bb (Ri,M)nT £] ) 

i=i 3=1 
< CT^RuM). I 

Lemma 3.3. C a j m {R\, R^ bb (Ri, M)) < 3M . 

Proof: For each server Vi G Ri, the algorithm ensures that either there are at least a open 
facilities from the set Tj or Vi is at distance at most 2M from another Vj G i?i that has a 
open facilities from the set Tj. In the first case the distance is at most M and in the second 
- at most 3M. | 

Now we present an approximation algorithm A conc _ a _ bu for the concentrated a_backup 
problem, using the relaxation algorithm A a _ bb for the a_bounded backup problem. 



Algorithm A conc _ a _ bu (I , 

(1) For every subset T C {SC V u \ v, u G V} such that \T\ < a do: 

• M(T) «- Ylm^T m 

• let R a aJbb( R i> M ( T )) ^A aJbb (I,Ri,M(T)). 

(2) Return the set R a ^ bb (Ri,M(T)) with the minimum cost 

CconcaJm 

(R^R^R^MiT))). 



Lemma 3.4. Cf 9 nc _ a _ bu (I, R,) < 3aC* conc _ a _ bu (l , R t ) . 

Proof: Denote the optimal solution for concha Jbu on (I, Ri) by R 2 = R*onc.a.bu(^i) • Then 

C cemc-CLjnJJ- ■> Rl) = C 'conc_aJm{l i R\i R2) = C fadliRv) + C a _ bu {I, R\ , -fi^)- 

Let Uj} C i?i and Uj} C ^ U for some j < a be the sets of 

nodes that attain the maximum shipping cost, i.e., satisfy C a _ bu (I, R\, R 2 ) = Mo for 

M = E SC Ui>Vi = Ew(«i)4ni,»i). Then C c * onc Q bu (I, = C facil (R* 2 ) + M . Notice 
i=l i=l 

that there must be at least a nodes in the set R 2 U i?i at distance at most Mo from 
every server r in Clearly Cf ac u(R* abb (Ri,Mo)) < Cf ac u(R^). Since the algorithm 
examines all possible values of M(T), it tests also Mo- For this value, the returned set 
R^ g bb (Rii M ) has opening cost at most C* 66 (i?i, Mo) < Cf ac u(R 2 ) and backup cost at 
most C aJm (I,#i,#S 6 (-Ri,M )) < 3M by Lemmas O and E31 Since the algorithm takes 
the minimum cost C conc _ a _ bu (Ri, R^ g bb (I, R\, M(T))) over all possible subsets T, the result- 
ing cost is at most 

C conc.a.bu i 1 ' R l) ^ C conc _ a _ bu (I, R\, R^ bb (Ri , M ) ) 



< 



C f acu{Rl) + max I V u(r)d(r, R 1 U R^ b (Ri, M )\F) 

\F\<a I ^- ■" ' 

1 re(Fnfli) 



< C facil (R* 2 ) + 3aM < 3aC* oncjaJm (I,Rx). | 
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3.2. (1.5 + 7.5a)-approximation algorithm to the a_RFTFL 

We now present a polynomial time algorithm named A a _RFTFLi yielding a (1.5 + 
7.5a)-approximation for the robust fault-tolerant uncapacitated facility location prob- 
lem a_RFTFL against a failure of a nodes, for constant a > 1. Consider an instance 
/ = (G,l,f,io) of the problem. The algorithm is similar to Algorithm RFTFL, except for 
the third stage. Instead of invoking the 2-approximation algorithm A conc j> u for the concen- 
trated backup problem on the new instance I' and the set R\ , invoke the 3a-approximation 
algorithm A C onc_a_bu for the concentrated a_backup problem on the new instance I' and the 
set R\. Algorithm ^4 CO nc_a_bu returns a new set R^ 9 ■ Algorithm A a _RFTFL now returns the 
set i?i U R^ 9 . Proof of the following lemma is deferred to the full paper. 

Lemma 3.5. Algorithm A a _RFTFL yields a (1.5 + 7 .5a) -approximation for the a_RFTFL 
problem. 

4. Robust Fault-tolerant uncapacitated facility location on trees 

In this section we show that the RFTFL problem is NP-hard even on trees. The claim 
holds even in the case where only the edge lengths or only the node demands are variable 
and the other parameters are uniform. An instance of the RFTFL problem is (T, I, f, uj, P), 
where T is a tree, I, f and uj are defined as before and P is an integer. It is required to 
decide if the cost of the optimal solution to the RFTFL problem on the instance (T, l,f,co) 
is P or less. 

The proofs, via reductions from subset sum and from a variant of the partition problem, 
are deferred to the full paper. The following results are established. 

Theorem 4.1. RFTFL on trees is NP-complete even with 

(1) unit edge lengths and opening costs (but variable node demands), 

(2) unit node demands and opening costs (but variable edge lengths). 
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